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Abstract 

We examine the benefits of user cooperation under compute-and-forward. Much Hke in network coding, receivers 
in a compute-and-forward network recover finite-field Hnear combinations of transmitters' messages. Recovery is 
enabled by linear codes: transmitters map messages to a linear codebook, and receivers attempt to decode the 
incoming superposition of signals to an integer combination of codewords. However, the achievable computation 
rates are low if channel gains do not correspond to a suitable linear combination. In response to this challenge, 
we propose a cooperative approach to compute-and-forward. We devise a lattice-coding approach to block Markov 
encoding with which we construct a decode-and-forward style computation strategy. Transmitters broadcast lattice 
codewords, decode each other's messages, and then cooperatively transmit resolution information to aid receivers in 
decoding the integer combinations. Using our strategy, we show that cooperation offers a significant improvement 
Cf^ ' both in the achievable computation rate and in the diversity-multiplexing tradeoff. 

Index Terms 

C/3 ' Cooperative communications, lattice codes, network coding, block Markov encoding, diversity-multiplexing 

tradeoff 

K^ ■ I. Introduction 

in 

0> \ Interference is the primary obstacle to communications over wireless networks. Due to the broadcast nature of 

(^ \ the wireless medium, a transmitter's signal arrives not only at its intended receiver(s), but also at any terminal in the 

cn ' vicinity. This fact has proven to be a formidable challenge. Despite decades of study and a plethora of sophisticated 

z^ . techniques, the capacity of even the two-user interference channel remains unknown in general. 

'— ' \ Most approaches to interference entail the minimization of its effects. A special case of the Han-Kobayashi 

►^ ' scheme [I], in which receivers decode a portion of the interference, was recently shown to achieve rates within 

^ \ one bit of the capacity region of the two-user interference channel fl]. Interference alignment, in which interfering 

d signals are made to lie in a low-dimensional subspace by means of multiple antennas, changing channel conditions, 

or signal-scale techniques, has been shown to provide the optimal degrees of freedom of the interference channel: 

for large signal-to-noise ratios, each transmitter can achieve approximately half the rate possible in the absence of 

interference [3 1-[6|. In a somewhat similar approach, lattice codes are used in the many-to-one interference channel 

to align interference at the signal scale, allowing the receiver to decode the interference as though it came from 

a single receiver ||71. In each of these strategies, the goal is to minimize the effective interference seen by each 

receiver. 

Compute-and-forward IH, also known as physical-layer network coding [91 is an innovative technique which 
exploits, rather than eliminates, interference. Under compute-and-forward, receivers decode finite-field linear com- 
binations of transmitter's messages instead of messages themselves. If enough linearly independent combinations 
are recovered, the individual messages can be recovered further "downstream" in the network. In ||8j| such decoding 
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is enabled by lattice codes. Transmitters send out lattice codewords, noisy linear combinations of which arrive 
at the receivers. Each receiver decodes the incoming signal to an integer combination of the lattice codewords 
corresponding to the desired finite-field linear combination. Since an integer combination of lattice points is itself a 
lattice point, it can be decoded almost as though it were a single incoming signal. Linear combinations of messages 
therefore are often easier to decode than individual messages. 

The strategy proposed in [8] requires a correspondence between the channel gains and the desired integer 
combinations. If the channels do not produce suitable linear combinations of transmitters' signals, the receivers 
cannot easily recover suitable integer combinations of the lattice points. Several solutions to this challenge have 
been proposed. Integer-forcing receivers lITOl . ifTTI . in which linear receivers are chosen to induce integer- valued 
equivalent channels, were developed for compute-and-forward over multiple-input multiple-output (MIMO) chan- 
nels. In ifTll . a number-theoretic approach was developed to address this problem in the high-SNR regime. Using 
techniques from Diophantine approximations, an encoding strategy was proposed that achieves the full degrees of 
freedom. 

We take a different tack. Our approach is based on the observation that, if transmitters were able to encode 
their messages jointly, compute-and-forward would reduce to a multiple-antenna broadcast channel, the capacity of 
which is known iTlBl . While perfect cooperation is infeasible, users can cooperate partially by exploiting another 
consequence of the broadcast nature: transmitters can overhear each other's signals and jointly encode portions of 
their messages. The aim of this paper therefore is to examine the extent to which user cooperation can improve 
the performance of compute-and-forward. 

Our main contribution is a cooperative strategy for compute-and-forward. We develop a lattice-coding instantiation 
of block Markov encoding by decomposing the lattice codebook into two linearly independent, lower-rate constituent 
codes, called the resolution codebook and the vestigial codebook. Transmitters broadcast lattice codewords, after 
which they decode the codewords of other transmitters. They then transmit cooperatively the resolution codewords 
corresponding to the linear combinations desired at the receivers. Receivers employ a variant of sliding-window 
decoding tailored to our lattice decomposition. They decode the resolution information and subtract it from the 
original signal; they then need only to decode the vestigial component of the desired sum of lattice points. This 
strategy allows an improvement in computation rate due to two factors. First, since cooperating transmitters decode 
others' messages, they can jointly encode portions of the linear combinations directly, relaxing the need for receivers 
to recover the messages from separately-encoded signals. Second, the jointly encoded signals combine coherently 
at receivers, resulting in a beamforming gain. 

We also present a high-SNR analysis of our approach. User cooperation naturally lends itself to diversity gains, 
and we show that our approach indeed increases the diversity order under slow Rayleigh fading. We focus on the 
case of a single receiver. We prove inner bounds on the diversity-multiplexing tradeoff (DMT) using a combination 
of cooperative random coding techniques and our cooperative lattice strategy. Random coding achieves full diversity 
but performs poorly at high multiplexing gains, whereas lattice coding falls short of full diversity but maintains 
performance at high multiplexing gains. Between the two coding strategies we establish an achievable DMT whose 
comer points match the upper bound of a multiple-antenna, single output (MISO) system. 

A. Related work 

Compute-and-forward can be viewed as one of several wireless instantiations of network coding. Network coding 
was introduced in |[T4l . where it was shown that network coding achieves the multicast capacity of wireline networks. 
It was later shown that (random) linear network codes are sufficient for multicast IITSI - IITtI . and although linear 
codes are provably insufficient for general wireline networks iTTSl they remain popular due to their simplicity and 



effectiveness. Network coding has been applied to wireless networks by several means. Two information-theoretic 
techniques are the quantize-map-and-forward of |[T9l and the "noisy" network coding of ll20l . in which relays 
compress and re-encode the incoming superposition of signals. These approaches generalize the discrete-valued, 
noiseless combinations of wireline network coding to continuous-valued, noisy combinations over wireless links. For 
multicast networks, they come to within a constant gap of capacity. Finally, lattice techniques similar to compute- 
and-forward have been used for the two-way and multi-way relay channels, again achieving rates within a constant 
gap of capacity ll2TI-ll24l|. 

Lattice codes play a fundamental role in compute-and-forward. Early works on lattice codes Il25l - ll27l showed 
that they are sufficient to achieve capacity for the point-to-point AWGN channel. The performance of lattice codes 
under lattice decoding — in which the receiver quantizes the incoming signal to the nearest lattice point — was studied 
in |[28l . and it was shown in |[29l that lattice decoding achieves capacity. In addition to compute-and-forward, lattice 
codes have seen use in a variety of information-theoretic problems, including source coding Il30l - ll32l . physical-layer 
security Il33l - ll35l . and relay networks Il36l - ll39l . 

Finally, our approach relies heavily on the field of user cooperation. Cooperation was first introduced with the 
relay channel in fi40l . In PTI the relay channel is given a thorough treatment, and the most popular relaying 
strategies — now known as decode-and-forward and compress- and-forward — are presented. More recent work has 
focused on the diversity gains of cooperation Il42l - P7l . showing that cooperating transmitters can obtain diversity 
gains similar to that of multiple-antenna systems. 

B. Notation 

We use bold uppercase letters (e.g. A) to refer to matrices and bold lowercase letters (e.g. x) to refer to column 
vectors. For n x m matrix A, a^ refers to the ith column of A, i.e. A = [ai • • • a^]. We denote subvectors of a 
vector using x[a : b] = {xa,Xa+i, ■ ■ ■ ,Xb)^, where {■)'^ denotes the usual transpose. We use ||-|| for the Euclidean 
norm. Let o denote the element-wise or Hadamard product. Let ¥p denote the finite field of prime characteristic p, 
and let ® and denote addition and (matrix) multiplication, respectively, modulo p; however, we will occasionally 
treat the result of modular arithmetic as a member of the reals according to context. Let [x]^ = max{x, 0} denote 
the positive part of x. Finally, let 
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C„ac(h,P,a) = ^^min^^^^log^l + 

denote the symmetric -rate capacity of the I-user Gaussian multiple-access channel having channel gains h and 
noise variance a"^. 

C. Organization 

In Section In] we present the system model and define the performance metrics used in this paper. In Section HID we 
formally state our main results and provide intuition about their benefits. In Section |IV] we introduce lattice codes 
and present the lattice subspace decomposition used in our block Markov strategy. In Section |V] we present our 
cooperative computation strategies in detail and prove that they achieve the computation rates claimed in Section 
Hm In Section [Vl] we perform a high-SNR analysis of our strategies and prove that they provide the diversity- 
multiplexing gains claimed in Section |lIIJ In Section IVIII we present a few numerical examples to showcase the 
benefits of our approach. Finally, we conclude with Section IVIIII 



II. Preliminaries 

A. System model 

In the cooperative compute-and-forward network, depicted in Figure [T] L transmitters communicate with M < L 
receivers over the wireless medium. Each of the L users has T messages 'Wi{t) G F^, for 1 < t < T. Structurally, 
this network resembles the compound multiple-access channel or, when M = L, the interference channel. However, 
unlike those more traditional networks, here each receiver intends to decode a finite-field linear combinatioiu of 
the transmitters' messages: 

L 

fmW = 0a/^0Wi(i), (1) 

1=1 

for aim £ ^- Let the matrix A = [aim] G Z^^^ describe the functions computed by the receivers. 
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Fig. I. The cooperative compute-and-forward network. L users cooperatively transmit to M receivers, wliich decode the desired linear 
functions. 

We divide transmissions into T + 1 blocks of n channel uses each. At block t, each transmitter / broadcasts a 
signal x;(t) G M", subject to an average power constraint: 

-\\Mt)f<P, 

n 
for some P > 0. The superposition of the transmitters' signals, scaled by channel coefficients and corrupted by 
noise, arrives at each receiver: 

L 

Ym (t) = Y, ^l^^l (*) + n(0 > (2) 

1=1 

where him G K is the channel coefficient from transmitter I to receiver m, and n(t) is a white, unit-variance 
Gaussian random vector. For convenience, we gather the channel coefficients into the matrix H = [him]- 
Each transmitter / also obtains the noisy superposition of the other transmitters' signals: 

L 
zi{t) = ^gi,l^l'{t) + ni{t), (3) 

l'=i 

where gi^ G M is the channel coefficient from transmitter /' to transmitter I, and n/(t) is again white, unit-variance 
Gaussian. Again we stack the channel coefficients into a matrix G = [gi'i] with diagonal elements equal to zero. 
The choice of zero for the diagonal elements implies full-duplex operation, meaning that transmitters can transmit 
and receive simultaneously. We further assume that channel matrices H and G are fixed and known globally among 
the transmitters and receivers. 

'Very precisely, receivers compute any of a sequence of linear combinations since, as we shall see, fc,p — !> cxd as the codeword length 
becomes large. 



We will refer occasionally to the non-cooperative compute-and-forward network, which is identical to the 
cooperative network except that the transmitters have no access to each other's transmissions. We model this 
formally by fixing G as the all-zero matrix. 

B. Computation capacity 

We are interested in the computation capacity of the network. Since the receivers recover functions of incoming 
messages, rather than the messages themselves, the computation capacity is defined somewhat differently than the 
capacity of ordinary channels. We endow each transmitter with an encoder Ei : F^^-^ x M"^^ — )• M"^(^+^). That 
is, the encoder Ei takes as its input the messages w;(t) and the received signals z;(t) and generates as its output 
the codewords x;(t). We impose a causality restriction on £";: the output codeword Xi(t) may depend on received 
signals Zi(s) only for s < t. As usual, the encoding rate is defined as the logarithm of the cardinality of the message 
set divided by the number of channel realizations over which the messages are encoded: 

^ Tlog^dF^I) ^ Tfclog2(p) ^ fclog^b) 
n(r+l) n{T+l) ^ n ' 

where the approximation holds for large T. Note that this is the symmetric rate among all transmitters. 

We endow each receiver with a decoder Dm ■ ]R"^(^+^) — )• F^^^, taking as inputs the received signals ym{t) 
and generating as outputs the estimates fm(i)- Let the absolute probability of error be the probability that any 
receiver makes an incorrect estimate of any of the desired functions: 

Pe = Pr{f„,(i) / f™(t), for any 1 < m < Af, 1 < t < T}. (5) 

We say that a computation rate R is achievable if for any e > there exists a sequence of encoders with encoding 
rate greater than R — e and decoders such that Pe — ^ as n — )• oo. For fixed channel gains H, G, function 
coefficients A, and transmit power P, let i?(H,G,A,P) denote the supremum over all achievable computation 
rates. 

In order to define the computation capacity we need to place limitations on the permissible function coefficients 
A. Otherwise we could choose a trivial coefficient matrix, such as the all-zero matrix, for which the achievable 
computation rate is unbounded. We therefore require that A be a member of the following set: 

^ = {A G Z^""^"^ : rank(A) = M, \/ m 3 I such that a^/ / 0}. (6) 

The first condition ensures that the recovered functions retain as much information as possible about the individual 
transmitters' messages; for L = M it implies that one can recover the individual messages from the recovered 
functions. The second condition, which is redundant for L = M, ensures that each transmitter is represented in the 
recovered messages; the receivers cannot simply ignore a transmitter in order to achieve a higher computation rate. 
Finally, we define the computation capacity as the supremum of achievable rates over the set of permissible 
coefficient matrices: 

C(H, G, P) = sup R{U, G, A, P). (7) 

AeA 

In their seminal work, Nazer and Gastpar developed a computation strategy based on nested lattice codes fF]. It 
achieves the following computation rate: 



i?nc(H,P) = max min 

Ae^ l<m<M 



- log2(l + P \\hmf) - - log2(||a„f + P(||a„f ||h„f - la^h^p)) 



(8) 
The first term in dD corresponds to the power in the received signal, whereas the second term is a penalty determined 



by the gap in the Cauchy-Schwarz inequaUty between h^ and a^- The closer h^ and a^ are to being co-Unear, the 
smaller is the rate penalty. Since the Nazer-Gastpar scheme was designed for a non-cooperative network, the rate 
does not depend on G; nevertheless, it serves as a lower bound on the cooperative computation capacity C(H, G). 
In the sequel we present a cooperative computation strategy based in part on the Nazer-Gastpar scheme. 

C. Diversity-multiplexing tradeoff 

One advantage of user cooperation is that cooperating transmitters can achieve performance similar to that of 
a multiple-antenna transmitter. Multiple antennas can improve performance on two fronts: increased reliability in 
the presence of slow channel fading, and increased throughput. In the high-SNR regime, the diversity-multiplexing 
tradeoff quantifies this improvement |l48l. Let the elements of H and G be identically and independently dis- 
tributed according to a Rayleigh distribution. Next, suppose there is a scheme that achieves the computation rate 
i?scheme(H, G, P). Then, the diversity order at multiplexing gain r is defined as 

logPr{fischeme(H,G,P)<§log(P)} 

d{r) = hm ^ . (9) 

P-5>oo logP 

In other words, d{r) is the exponent of the outage probability, with the rate taken to have multiplexing gain r, as 
the SNR goes to infinity. The diversity-multiplexing tradeoff of the system, denoted by d*{r), is the supremum of 
d{r) over all possible schemes. 

The multiplexing gain for compute-and-forward is studied in lfT2ll . There it is shown that, using the Nazer-Gastpar 
approach, the multiplexing gain can be no higher than -j-^. In other words, d{r) = for r > j^ for this scheme. 
In the sequel we show that we can achieve a better diversity-multiplexing tradeoff, including a multiplexing gain 
of unity, for both cooperative and non-cooperative networks having a single receiver. 

III. Main Results 
A. Upper bounds 

First we present two upper bounds on the cooperative computation rate, the proofs of which are contained in 
the Appendix [A] We obtain the first bound by supposing that the transmitters are capable of perfect cooperation, 
which is equivalent to having a genie supply all messages to each transmitter. The problem then reduces to a 
multiple-input, single-output (MISO) broadcast channel, the capacity of which is known lfT3]| . In the sequel we use 
this result to bound the diversity-multiplexing tradeoff. 

Theorem 1: Let the capacity region of a Gaussian MISO broadcast channel be denoted by 
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where conv{-} is the convex hull, 11 is the set of permutations from {1, • • • L} to itself, and V^ is a collection of 
positive semi-definite matrices such that X]m=i t^(^m) < ^P- Then the computation capacity of the cooperative 
compute-and-forward network is bounded above by 

C{U,G,P)<R+^.JU,P), (11) 

where 

<i,„(H, P) = sup{r : rl E C^i,o(H, P)} (12) 

is the symmetric -rate capacity of the Gaussian MISO broadcast channel. 



We obtain the next bound by supposing a genie supplies to the receivers all messages except for those of a single 
transmitter I. Then the receivers need only to recover the messages of transmitter / in order to compute any suitable 
set of functions. This converts the system to a compound relay channel in which the other transmitters serve as 
dedicated relays; we bound the capacity of this channel using cut-set arguments. This upper bound is somewhat 
more realistic than -R^jsq, and we use it in Section IVIII for comparisons to our achievable rates. 

Theorem 2: For each transmitter 1 < / < L, let S^ = {1, • • • , / — 1, Z + 1, • • • , /} be the set of transmitters other 
than transmitter /. Then the computation capacity of the cooperative compute- and-forward network is bounded 
above by 

C(H,G,P)<i?+^gi^(H,G,P), 

where 

-^sinKlc(H,G,P) = rnin max min max min/(x;, 3:5; y^, 2:50 jx^c), (13) 

1<KI, Ae^m,a;„^0 p(x) S&Si 

where p(x) is any distribution over the transmitted signals (xi, • • • xl)^ satisfying the input power constraint. 

B. Achievable rates 

Here we present the computation rates achieved by our cooperative strategy. Our approach is decode-and- 
forward in nature: at one block transmitters send out lattice codewords corresponding to their individual messages; 
these messages are decoded by other transmitters. At the next block transmitters cooperatively encode resolution 
information to assist the receivers. As with any decode-and-forward strategy, we must contend with the fact that it 
may be difficult for transmitters to decode each other's messages. We therefore require only some of the transmitters 
to cooperateo A subset B of the transmitters decodes the messages of every other user, after which they cooperatively 
transmit resolution information to the receivers. Transmitters not in B, not having decoded incoming messages, do 
not send any resolution information. We present the details of this strategy, as well as an achievability proof, in 
Section |Vl 

Theorem 3: Let ^ C {1, • • • ,L}. In the cooperative compute-and-forward network, the following computation 
rate is achievable: 

Rc('H.,G,P) =maxmin| minCmac(a[l : 1-1,1 + 1: L]ovo[l -.1-1,1 + 1 :L],P,1), 

mm <^ - logo 1 + — + 



\<m<M I 2 V 1 + Imr 



-log2(||PhmO Voll +/m,i>) - -log2 ( ||am|| {1 + Im,v) + P {\\Sim\\ ||hm o Vo|| -|a^(hmOVo)| 



(14) 



where 



I^,^ = P ||h„,ovof+ Y^ Ih^v^'lM (15) 

\ m'y^m,0 J 

is the interference power seen at receiver m as it decodes its resolution information, 

lm,v = P Yl |hmV„^'|' (16) 

^Other approaches are possible. For example, in an earlier work [49:| we partitioned the set of transmitters into clusters; transmitters would 
decode only in-cluster messages. In the interests of brevity we discuss only the approach presented in Theorem |3] 



is the interference seen at receiver m as it decodes the vestigial information, and for any vectors vq , vi , • • • , vm 
such that 

M 

5^ \vim\^ < lyi (17) 

m=0 

and Vim = for I y^ B and ttt, > 0. 

The achievable rate (IT4] | is a bit difficult to parse, so we take time here to describe each of its three components. 
First is the rate of a Gaussian multiple-access channel, which corresponds to the rate at which cooperating 
transmitters can decode others' messages. Second is the rate at which each receiver can decode the resolution 
information, which is that of a virtual MISO link between cooperating transmitters and the receiver; signals unrelated 
to the resolution information are treated as noise. Third is the rate at which the receivers, having already decoded 
the resolution information, can decode the vestigial component of the desired combination of lattice points; this is 
the Nazer-Gastpar rate of (JS), with resolution information intended for other receivers treated as noise. 

Each transmitter splits its power between sending its own lattice codewords and cooperatively sending resolution 
information. The split is defined by the steering vectors vq, vi, • • • , vm- Each element vio dictates the fraction of 
power transmitter / expends on its own lattice codewords. For cooperating transmitter /, each element vi^, dictates 
the fraction of power expended on resolution information for receiver m. The steering vectors introduce two separate 
notions of alignment. First, we can choose vq in order to minimize the Cauchy-Schwarz penalty in (IT4] |. Second, we 
can choose the remaining vectors v^ to trade off between increasing the coherence gain at the intended receivers 
and decreasing the interference generated at other receivers. Finding the optimum steering vectors is a non-covex 
problem; for further results and in our simulations we rely on a few heuristic means for selecting them. 

We can obtain a simpler expression for the achievable rate by choosing B = {!,■ ■ ■ , L} and taking the steering 
vectors vi, • • • , v^v/ to be zero-forcing beamformers. Thus the cooperative signals do not interfere at other receivers. 

Corollary 1: The following computation rate is achievable for the cooperative compute-and-forward network: 
i?zf (H, G, P) = maxmin | min Cmac(a[l : / - 1, / + 1 : L] o vo[l : / - 1, / + 1 : L], P, 1), 



mm 

l<m<M 



^ log2(l + P(||h^ O Vof + |h^V,^|2))- 



2 
for any vectors vq, vi, • • • , vj\f satisfying 

and 



- log2 ( lla^ir + P ( llamT l|hm ° voll^ - |a^(hm o vq) 



(18) 



M 



Yl \vim\^ < 1 (19) 



m=0 



v^hm' = 0,V m 7^ m' . (20) 

Since L > M, it is possible to choose non-trivial zero-forcing beamforming vectors for almost every H. 

Finally, choosing ;B = 0, we obtain an achievable rate for both the cooperative and non-cooperative compute- 
and-forward network. This yields a rate similar to dS], except that each transmitter can adjust its transmit power in 
order to tune the effective channels to match the desired linear functions. In fact this rate is a special case of the 
"superposition" compute-and-forward presented in |l8] Theorem 13]. 

Corollary 2: In both the non-cooperative compute-and-forward network and the cooperative compute-and-forward 



network, the following rate is achievable: 



-R(H, G, P) = max min 

AeAl<m<M 



^Iog2(l+P(||h^ovof))- 



-logsHlamll +P(||am|| llhmovoll -|a^(hmovo)| 



, (21) 



for any vq satisfying 

\vm\'^ < l,y 1 < I < L. (22) 

C. Diversity-multiplexing tradeoff 

Here we present our diversity-multiplexing tradeoff results, the proofs of which are presented in Section |Vl] We 
begin with the non-cooperative case. 

Theorem 4: For the non-cooperative compute-and-forward network, the diversity-multiplexing tradeoff for any 
scheme is upper-bounded as follows: 

d*(r) <d+(r) = l-r. (23) 

For the case of M = 1, d*{r) = d'^^{r). 

In other words, the DMT of the non-cooperative compute-and-forward network is bounded above by that of a scalar 
Gaussian channel. In the case of a single receiver, we can achieve this upper bound with lattice codes and signal 
alignment. With the steering vector vq chosen such that the equivalent channel vector is a constant, the achievable 
rate — and therefore the error probability — is approximately that of a single SISO link. 

Next, we look at the DMT of the cooperative compute-and-forward network. We start by presenting an upper 
bound. 

Theorem 5: For the cooperative compute-and-forward network, the diversity-multiplexing tradeoff is upper- 
bounded as 

d*{r)<dt{r) = L{l-r). (24) 

In other words, the DMT is upper-bounded by that of a single MISO link. In the case of a single receiver, we 
derive two lower bounds on the DMT. The first is derived using a rather simple strategy employing time sharing 
and Gaussian codes. 

Theorem 6: For the cooperative compute-and-forward network, the following diversity-multiplexing tradeoff is 
achievable: 

'^random (0 = ^min{l - 2r, (L - 1)(1 - 2(L - l)r)}. (25) 

In particular, d-„dom(0) = ^■ 

Since it involves time sharing, the strategy used in Theorem [6]has poor multiplexing performance. It does, however, 
achieve the full diversity gain of L. The second bound is derived using the cooperative computation strategy of 
Theorem |3] 

Theorem 7: For the cooperative compute-and-forward network, the following diversity-multiplexing tradeoff is 
achievable: 

Cttice(0 = 1 - r + min{[l - 2r] + , [{L - 1)(1 - rL)] + ] + 



max (L - 2) min{[l -x- r] + , [{L - 1)(1 - (L - l)r - x)] + , [x - r] + }. (26) 

0<a;<l 



Here,dr,,,.,,(0) = 2 + ^. 



10 

Here the main difficulty is tlie Cauchy-Schwarz penalty inherent to lattice coding. It turns out that choosing 
vq to align with the channels, as we did in the non-cooperative case, precludes cooperation with high probability. 
We therefore choose vq to be constant, taking the Cauchy-Schwarz penalty "on the chin." We balance the transmit 
power between sending fresh information, which helps transmitters decode others' messages, and sending resolution 
information, which helps the receiver decode the desired Unear combination. Choosing the balance properly, the 
benefits of cooperation outweigh the Cauchy-Schwarz penalty, but only enough to obtain a diversity gain of 
approximately 1/2 per transmitter. Nevertheless, for higher multiplexing gains lattice coding outperforms the strategy 
of Theorem |6] 

We plot the DMT bounds in Figure |2] For L = 2 lattice coding is sufficient to achieve full diversity, and the DMT 
achieved by lattice coding strictly dominates that achieved by random coding. For L > 2, lattice coding achieves 
better performance only for sufficiently high multiplexing gain. Random coding fails altogether at multiplexing 
gains higher than (L — l)/2 due to the need for transmitters to decode L — 1 separate messages and the need for 
time-sharing. Lattice coding, on the other hand, maintains non-zero diversity for every any < r < 1. Between 
the two strategies we obtain the corner points of the DMT region. 
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Fig. 2. Diversity-multiplexing tradeoff for L = 2, L = 5 transmitters and a single reciever 



IV. Lattice codes 

In this section we first introduce the basics of lattice codes, after which we present the lattice decomposition 
employed in our cooperative computation strategy. 



A. Capacity-achieving lattice codes 

Formally, a lattice A is a discrete additive subgroup of M", which implies that for any Ai,A2 G A we have 
Ai + A2 € A and Ai — A2 G A. Any lattice can be generated by taking integer combinations of (not necessarily 
unique) basis vectors. Choosing these basis vectors as columns, we form the generator matrix of A, denoted by 
G G W^"^: 



A = GZ". 



(27) 
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We let Qa denote the lattice quantizer, which maps any point x G R" to the nearest point in A: 

(3a(x) = argmin llx — All . (28) 

AeA 

The lattice A induces a partition of M" into the Voronoi regions V(A) of each lattice point A G A: 

V(A) = {x G M" : Qa(x) = A}, (29) 

where ties are broken arbitrarily. In other words, the Voronoi region of A G A is simply the set of points that are 
closer to A than to any other lattice point. 

Let V = V(0) be the fundamental Voronoi region of A. The mod operation with respect to A returns the 
quantization error 

X mod A = X — Qa(x), (30) 

which is always a member of V. The mod operation allows one to draw an analogy with modulo arithmetic over 
a finite field. Just as modulo arithmetic ensures that the result remains a member of the finite field, performing 
arithmetic modulo A "wraps" the result within V. The mod operation obeys the associativity property: 

[[x] mod A + y] mod A = [x + y] mod A. (31) 

The second moment o"^(A) quantifies the average power of a random variable uniformly distributed inside V: 

where Vol (A) is the volume of a set A C M". The normalized second moment is defined as: 

G(A) = ^. (33) 

Vol(V)" 

The normalized second moment provides a measure of the efficiency of A as a shaping region. The closer V is to 
being spherical, the smaller G(A) is. 

The covering radius rcov(A) is the radius of the smallest sphere that covers V: 

reov(A) = inf{r > 0|V C rBn], (34) 

r 

where Bn is the unit sphere in M". The effective radius refj(A) be the radius of a sphere with the same volume as 

Note that rcov(A) > reflf(A). 

In order to construct lattice codebooks suitable for proving information-theoretic results, we require sequences 
of lattices that asymptotically satisfy several desirable properties. For example, we say that a sequence of lattices 
{A*^")}, A(") G M", is good for covering or Rogers good |[50l provided the covering radius approaches the effective 
radius: 

lim TTT-TT = 1- 

Similarly, a sequence of lattices is good for quantization provided 

lim G(A(")) = — . 
n— >oo 27re 
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Finally, let z ~ A/'(0,o"^I) be a Gaussian random vector. Define the volume-to-noise ratio /i(A, Pg) as 

(Vol(V))^ 



/i(A,Pe 



a2 



where cj^ is chosen such that Pr{z ^ V} = Pe- A sequence of lattices A^") is good for AWGN coding or Poltyrev 
good if 

lim /i(A("),Pe) = 27re. 

The existence of such sequences was proven by Poltyrev in [51] . Furthermore, Erez et al. proved that there exist 
sequences of lattices that are simultaneously good for covering, quantization, and AWGN coding |[52l . 

Lattice codebooks are constructed using nested lattices, as depicted in Figure |3] Here we review the construction of 
codebooks sufficient to achieve capacity for the AWGN point-to-point channel, which is the model for codebooks 
to be used throughout this paper. Let A^ be a sequence of shaping lattices that are good for covering and 
AWGN coding and satisfy a'^i^Kl' ) = 1, and let G^ denote generator matrices for each lattice in the sequence. 
Then, following 1531 . we adapt Construction A ||28l to construct a sequence of coding lattices Ac D As . The 
construction process goes as follows: 

1) For each n, choose an integer k and a prime p. Draw a n x k matrix Fc G Fp^'^ randomly and uniformly. 

(n) 

2) Construct the linear codebook over ¥p defined by Fc : 

3) "Lift" the codebook C^"-'^ to W by defining the lattice 

4) Finally, rotate Ac so that it is nested inside A^ : 

a(") = g(")a("). 

We form the lattice codebook by taking the intersection of the coding lattice with the fundamental Voronoi region 
of the shaping lattice: 

cW =A(")nvi"^ 

The rate of this codebook is 

P=llog,|CW| = ^i^^^^. 
n n 

It is shown in |f8l that choosing p such that n/p — ;■ as n — )■ cxd guarantees that the sequence of coding lattices Ac -^ 
is good for AWGN coding. For any desired rate i? > 0, we can construct an appropriate sequence of codebooks 
by choosing p = nlog2{n) and k = Li^f^J- 

In essence, the preceding codebook construction allows us to take a linear block code over ¥p and to create 
a corresponding Linear code over Euclidean space. If the underlying linear code achieves capacity, as does the 
ensemble of random linear codes, so too does the resulting lattice codebook. We can use any linear code in place 
of the one chosen above; the performance cost is only the gap to capacity of the linear code chosen. 

For the lattice compute-and-forward proposed in [8|, an important fact is that there exists a mapping from 
finite-field messages to lattice codewords that preserves linearity. That is, the mapping sends finite-field linear 
combinations of messages to integer sums of lattice points modulo the shaping lattice. Formally, this implies that 
there is an isomorphism between the additive group of field elements and the group of lattice codewords modulo 
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Fig. 3. Nested lattice codes. White dots are elements of the coding lattice, and black dots are elements of the shaping lattice. Each lattice 
point inside the shaded Voronoi region Vs is a member of the codebook. 

the shaping lattice. We state this result in the following lemma. 
Lemma 1: There exists an isomorphism (/>: Fp —)• C *-"•*, namely 

cj){w) = [G("V~^F(")w] mod Ai"). (36) 

Proof: We need to show that (/> is a bijection and that it respects the group operation; that is, 0(wi © W2) = 
[</>(wi) + <^(w2)] mod As for any wi, W2 G F^. That is a bijection was shown in HI Lemma 5]. To see that 
(f) respects the group operation, we resort to direct computation: 

(/.(wi © W2) = [G("V"^F(") (wi © W2)] mod A(") (37) 

= [G(" V (F(") (wi + W2) + pi)] mod Ai") , (38) 

where i G Z" is a vector of integers corresponding to the discrepancy between real-valued and modulo-p arithmetic. 
Continuing, we get 

0(wi © W2) = [Gi")p-^F(")(wi + W2) + G(")i] mod A^") (39) 

= [G(")p-iF(")(wi + W2)] mod A(") (40) 

= [0(wi) + ,/.(w2)] mod A("). (41) 

where the last equality is due to the fact that Gg i G A^ and that adding a member of Ag does not change the 
result of the arithmetic modulo A^ . ■ 

B. Lattice subspaces 

In our cooperative computation strategy, we tailor block Markov encoding to lattice codes. To do so, we introduce 
a key ingredient of our approach: the decomposition of the lattice codebook into subspaces. Let kr < k, and let 
Fr G Fp^^'' denote the matrix composed of the first kr columns of Fc . Similarly, let k^ = k — kr, and let 
fI," G Y^^^^ denote the matrix of the remaining k^ columns. Then define the resolution lattice A^ and the 
vestigialiij lattice Ay as 

A(") = G(")(p-iF(")F^- + Z'^) 
A^,'^) = G(")(p-iF,[,")F^" +Z'^). 



This terminology is intended to convey the fact that this lattice component encodes the "residual" or "leftover" information bits. We use 
this less-common synonym in order to minimize notational confusion. 
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Since these sequences of lattices are special cases of the lattice construction from the previous subsection, each 
sequence is individually good for AWGN codin 
Define the resolution and vestigial codebooks 



sequence is individually good for AWGN coding. By construction Ac = A^ ■'+Ai,"^ and A^ ^ C Ar ,A^' C Ac 



having rates 



4") = a(") n v^(,o 

rin) ^ A (n) PI y 



Rr = — log2 P 

n 

Rv = — log2 p. 
n 



log2 (P) - 



to achieve the 



By construction Rj. + Ry = Re- Furthermore, for any < i?,. < R, we can choose k^ = [ 

desired resolution codebook rate. For any message w € F^, we can define the projection onto the resolution and 

vestigial codebook as follows: 



(^^(w) = [GsP ^Frw[l : kr]] mod A^ 
(j)v{w) = [GsP^'^Fyw[kr + 1 : A;]] mod A^. 

Using these projections, we can define a linear decomposition of the lattice codebook, as depicted in Figure |4l 
Lemma 2: For any w €¥p, 

0(w) = [0, ( w) + (/>. (w)] mod A(") , (42) 

Proof: This result follows from Lemma [T] By definition w = (w^[l : kr]Ol )'^ (0^ ■w'^[kr + 1 : k])"^, so 

0(w) = <A((w^[l : krlOlf e iOl,w^[kr + 1 : k]f) 

= [0((w^[l : kr],Olf) + ^{iOl,^''[kr + 1 : k]f)] mod A^ 
= [M^) + M^)] modA("), 

(n) (n) 

where the last equality follows from the definition of Fr and Fi, ; zeroing out the unwanted portions of w is 
equivalent to discarding the associated columns of F^"-'. ■ 





,(n) 



,(n) 



Fig. 4. Lattice subspace decomposition. Each lattice codeword in C^"' is the sum of a point in Ci"' (left) and a point in Ci"' (right). The 
shaded region Vs defines the codebook, whereas the strip-shaped Voronoi regions Vr and Vv define the decoding regions of the resolution 
and vestigial codebooks, respectively. 
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The codeword 0(w) G C^") is therefore the sum of two Unearly independent lattice points: (/)r(w), which we 
call the resolution information and which encodes the first kr log2P bits of the message, and 0t,(w), which we call 
the vestigial information and which encodes the remaining k^ log2 P bits. Furthermore, the decomposition is linear 
in the sense that the decomposition of sums of lattice points is the same as the sum of decompositions. 

Lemma 3: Let wi and W2 be messages in F^', and let w = wi © W2. Then 

(PrM = [M^i) + 0r(w2)] mod A^") , (43) 

and 

0.(w) = IM^i) + M^2)] mod A("). (44) 

Proof: This follows directly from the fact that is an isomorphism: 

</'r(w) = (^r(wi © W2) (45) 

= ,^(wi[l : A;,.]©W2[1: A:^]) (46) 

= [0(wi[l : kr]) + 0(w2[l : kr])] mod Ai") (47) 

= [M^i) + M'^2)] mod A("). (48) 

A similar argument holds for (/>„. ■ 

The preceding decomposition permits a lattice-coding instantiation of block Markov encoding. After the trans- 
mission of a lattice codeword, cooperating users can transmit the lower-rate resolution component. The intended 
receiver first decodes the resolution component and subtracts it from the received signal; the receiver then needs 
only to decode the lower-rate vestigial component. Although in this paper we apply the technique to compute- 
and-forward, it can be applied to a variety of relay systems. For example, in a related work [39] an innovative 
lattice list decoding technique is proposed to show that lattice codes can achieve the capacity of the physically 
degraded three-terminal relay channel. Using our technique, it is straightforward to construct an alternative strategy 
that establishes the same result. 

V. Cooperative Computation: Encoding Strategy and Achievable Rate 

In this section we describe our cooperative computation strategy and derive its achievable rate, which amounts 
to a proof of Theorem |3] Our approach is based on the lattice decomposition described in Section IIV-BI Messages 
are communicated in two stages. In the first stage, each transmitter sends the lattice codeword corresponding to 
its message; this codeword is decoded by a subset of the other transmitters. In the second stage, transmitters 
cooperatively transmit the resolution component of the linear combinations desired at the receivers. Similarly, 
receivers decode in two stages. They first decode the resolution component, which they then subtract away from 
the received signal. Then they need only to decode the vestigial component, which is easier to decode due to its 
lower rate. 

Proof of Theorem \3} Our proof goes in three parts: a description of the encoding scheme, a description of 
the decoding scheme, and an analysis of the probability of error 

Encoding: Each transmitter employs identical lattice codebooksQ C having rate Re- The codebook C decomposes 
into resolution and vestigial codebooks Cr and Cy which have respective rates Rr and R.^. As noted in Section 
TV-B[ we have R^ = Rr + Ry 

''Technically we have a sequence of lattice codebooks indexed by n. In the interest of notational simplicity we drop the superscripts. 
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Transmitters encode their T messages over T + 1 blocks as depicted in Table H At block t, each transmitter I 
has a message w;(t), which it encodes by mapping it to the corresponding codeword in C: 

Xi{t) = 4>{Mt))- (49) 

By Lemma |2l each lattice codeword can be decomposed by projecting onto the resolution and vestigial codebooks: 

Xr,l{t) = 0r(w;(t)) 

When transmitting the lattice codeword, each user dithers the lattice point over the shaping region; this ensures 
that the codebook looks approximately Gaussian as well as makes the codewords of each transmitter statistically 
independeno We therefore define the effective codeword 

ci{t) = [Xi{t)+ti{t)]modAs, (50) 

where t;(t) is a dither drawn randomly and uniformly over Vs, independent for each 1 < I < L and 1 < t < T. 
Each receiver m intends to recover the finite-field linear combination fm(i) = ©/=i aim^m{t), which corresponds 
to the lattice point 

r ^ 

A^(i) = (/)(f(i)) = ^aimXiit) mod A,. (51) 

.1=1 

As with the individual codewords, we can decompose Xm{t) into resolution and vestigial components: 

Ar,m(t) = 0r(fm(t)) (52) 

X^,m{t) = Mim{t)). (53) 

The transmitters in B will cooperatively transmit Xr^mii) to each receiver, again dithering the lattice point over V^. 
The effective codeword is 

Cr,m = lK,m{t) + Sm{t)] mod A^, (54) 

where, similar to before, Sm(i) is a dither drawn uniformly over V^ and independent for each 1 < m < M, and 
1 < t < T. 
At block t = 1, each transmitter simply sends its own lattice codeword: 

xKl) = VPvioCiit). (55) 

For subsequent blocks 2 < t <T, each transmitter in B sends a combination of "fresh" information corresponding 
to its own message w/(t) and resolution information corresponding to the messages sent in the previous time slot. 
Suppose that each transmitter in B has successfully decoded A/'(t — 1) for each /' ^ I. Then each transmitter in B 
can construct every Xm{t) and, by extension, every Xr,m{t)- Every transmitter sends its own lattice codeword, and 
transmitters in B send the resolution components for each receiver: 



VP (vioCiit) + E^=i VlmCr,m{t " 1) I , for I G B 

^ ^ ■ (56) 

\/PviQCi{t), otherwise 



Finally, at block t = T + I there is no new fresh information for the transmitters to send. Each transmitter in B 



For further discussion of the need for dithers, see 
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sends only the resolution information corresponding to block T, and the other transmitters send nothing: 

^Em=l «^n^C,,b(,)^(^), for / G S 



x^r + i) 



Otherwise 



(57) 



TABLE I 
Superposition Block Markov encoding for Theorem[3] 



t = 1 



^'lOCl(2) + Yjm=l '^lmCr,m(l) 
W20C2(2) 



t = T+l 



^70" 



Xi(t),lGe 
X2(t),2^i3 

XL(t),i.Gi3 



i'ioci(l) 

1'20C2(1) 







Af 



VL0Cl{2) + Y.m=l VLmCr,m{'\-) 



rM 



2—tm=l ^Lm^r,my^ ) 



Note that, since A^ has normalized second moment equal to unity, and since the dithers are independently and 
uniformly drawn from Vs, we have with high probability 



1 „ 

- X; 
n 



M 



P Yl ^fm < P- 



(58) 



m=0 



Thus the transmit signals obey the average power constraint. 

Decoding: Decoding proceeds in three stages. Each transmitter decodes the messages of every other transmitter, 
the receivers decode the resolution information send cooperatively by the clusters, and finally the receivers decode 
the vestigial information. Having decoded both components of the desired lattice point, the receiver can recover 
the desired linear function. 

At block t = I each transmitter receives the superposition of all the other transmitters' signals, scaled by channel 
gains and corrupted by noise: 



zi{l) = VP^vvogi'iCi'il) + ni{t). 



(59) 



Each transmitter forms estimates wi'i{l) for every /' 7^ / via typical sequence decoding: if there is a unique 
collection of messages jointly typical with the received signal, that collection is taken as the estimate; otherwise 
an error is declared. Note that in this case the transmitters do not employ lattice decoding. 

For blocks 2 < t <T the situation is similar. Each transmitter receives the superposition of other transmitters' 
signals, but in this case the received signals also contain resolution infoiTnation: 



M 



Zi{t) = \/P j Ygi'lVi,QCi>{t) + ^ ^ gi'lVl'mCr,m{t " 1) ) + n/(t). 



(60) 



Supposing that each transmitter has successfully decoded the messages from block t — 1, it knows the resolution 
information. It therefore can subtract this component out, resulting in the effective signal 

M 
A{t) = ^KO - ^^ Xl XI 9l'lVl'mCr,m{t - 1) (61) 



l'el3m=l 

P^gi'iVvQCvit) + ni{t) 



(62) 
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Now, just as for t = I, each transmitter can form estimates wp/(t) of the other transmitters' messages via typical 
sequence decoding. 

Next we turn to the receivers. To decode the function fm(i), each receiver first decodes the resolution information 
from the signal received in block t + 1: 

L M 

ym{t + 1) = ^/^^ hmVmCi{t + 1) + \/P ^ ^ himVlm'Cl,m'{t) + 
1=1 m'^m IgB 



P'^himVlmCr,m{t)+nm{t + l). (63) 

leB 
Each receiver decodes the resolution information treating the interference — in this case the fresh information from 
each transmitter and the resolution information intended for other receivers — as noise. Each estimate Xr,m{t) is 
formed via lattice decoding as outlined in ||29l . The receivers first apply MMSE scaling to the incoming signal and 
subtract off the dither. Let 

L M 

n'^{t + 1) = y^Yl ^lrnVmCi{t + 1) + \/^ XI Zl hlmVlm'Cl^m'{t) + n„(t + 1) (64) 

1=1 m'j^m l&B 

be the sum of the interference and noise at receiver m. Then the scaled signal is 



mod A, 



y'^it + 1) = [7^(t + l)y„(t + 1) - s^(t + 1)] mod A, 

= 7m(i + 1)Vp'Y^ hlmVlmCr,m{t) + ^n^it + 1) - Sm(t + 1) 

leB 

\ l£B J 

= [Xr,m{t + 1) + n';^{t + 1)] mod A„ 



where 



n';^{t + 1) = I 7m(t + l)VP^himVlm - 1 J C^,m(i) + 7m{t + l)n^{t + 1) 

V leB J 



(65) 
(66) 

mod As (67) 
(68) 

(69) 



is the effective noise, including thermal noise, interference, and self-noise associated with MMSE scaling. Then, 
the estimate is formed by lattice quantization: 



Xr,m{t) = QA^{y'^{t + l)). 



(70) 



After decoding the resolution information, each receiver turns to ym{t) to decode the vestigial component Aj, m(0- 
First, we note that, supposing that each receiver has successfully decoded the resolution information from the 
previous block, it can subtract that portion of the interference, yielding: 



Ymii) = ym{t) - ^^himVlmCr,m{t - 1) 



leB 



VP^hlmVloCl{t) + \^ X '^hlmVlm'Cr,m'(t-l) + nm{t). 

m'j^m IgB 



(71) 



(72) 



1=1 



Furthermore, supposing that the resolution information was decoded successfully, each receiver can subtract Xr,m{t) 
from the received signal modulo the shaping lattice. Finally, in preparation for lattice decoding, we apply MMSE 
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scaling to the signal and subtract the dithers as in ||8], ||29l . Let 

be the sum of the interference and noise in ym{t)- The resulting signal is then 

L 



(73) 



mod Ao 



«m(t)ym(0 - K,mit) " ^ aimti{t) mod A., 

1=1 

L 

y^^{am{t)VPhlmVl0Cl{t) - aimtlit)) - Xr,m{t) + Qm(t)n^(i) 

.1=1 
L L 

y^ aim(Q(0 - h{t)) - K,m{t) + y^^{am{t)'/PhimVlm " aim)ci{t) + am(t)n^(t) 
.1=1 1=1 

L 
Xmit) - Xr,mit) + ^(am(i)\/^/i/m^im " aim)ci{t) + a.m(t)n^(t) 



«=1 



mod A, 



Xv,m{t) + y^^{am{t)'/PhimVlm " aim)Cl{t) + arn(t)n^(t) 



i=l 



mod A, 



[Xv,m.{t) + n'^(t)] mod A^, 



where 



n"(*) = Xl("'"(^)^^''"^'"» ~ aim)ci{t) + am(t)n^(t) 



(74) 

(75) 

mod A^ (76) 

(77) 

(78) 
(79) 

(80) 



1=1 



is the effective noise, including thermal noise, interference from other transmitters and clusters, and self-noise 
associated with MMSE scaUng. Each receiver decodes the estimate Xv,m{t) by quantizing to the nearest point in 

A„: 

Kmit) = QAAy'Lit)). (81) 

Finally, having recovered both the resolution and vestigial components, each receiver constructs its estimate of 
the desired lattice codeword, from which it can recover the desired finite-field message: 



fm(t) = <l)~\X„^{t)) = (P-^ (^ [Xr,m{t) + A^,„(t)] mod A,) . 



(82) 



Probability of error: An error occurs when (a) any of the transmitters in B fails to decode the other transmitters' 
messages, (b) any of the receivers fails to decode correctly the incoming resolution information, or (c) when any 
of the receivers fails to decode correctly the vestigial information associated with the desired lattice point. By the 
union bound, the probability of error follows 

T M 

Pe<^Yl Pr{fm(t) / fm(t)} (83) 

t=l m=l 
T T M T M 

< Y. Zl Zl^^i^^''^*) ^ ^''(*)} + Y.Y. MAr,m(t) / A,,„,(i)} + Y.Y. Pr{^-,™(*) / A,,^(i)}. 
t=l l^B V^l t=l m-1 t=l m=l 

(84) 

Here we show that as long as the rates satisfy (fT4l) . each error term in (|84] | goes to zero exponentially. We start 
with the first summation. Each transmitter decodes the messages within its cluster via typical sequence decoding 
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while treating all out-of-cluster interference as noise. By Lemma [6] the joint mutual information between the transmit 
codewords ci{t) and the receive signal z'{t) approaches that of a Gaussian multiple-access channel with channel 
coefficients giqVi'Q, transmit power P, and unit noise power. Therefore, so long as 

R < mmC7^ac(a[l : / - 1, / + 1 : L] o vo[l : / - 1, / + 1 : L], P, 1), (85) 

then Pr{w;/;(t) 7^ wp(t)} — )• exponentially for each I and /' 7^ I. 

Next we turn to the resolution information. Here each receiver decodes Ar,m(i) via lattice decoding on ym(t + 1). 
In |[29l it is shown that lattice decoding is sufficient to achieve the capacity of the Gaussian channel. From (|64l ) 
we compute that the the interference power in n^(t + 1) is 






(86) 



Similarly, we compute that the power of the resolution information in ym(i+l) is Plh^v^p. Putting these together, 
we find that if 

Rr < mill - log2 1 + ' " "' , (87) 

then Pr{Ar,m(i) / K,m{'t)} — ^ exponentially for each m. 

Finally, we look at the vestigial information. Here each receiver decodes Xv,m{t) by lattice decoding the sum 
of multiple incoming lattice points, so we borrow the main result from {8]. We compute the interference power in 
(1731) to be 

Im,v = P Yl IhmVm'P, (88) 

and the effective channel gains in (1791 ) are hm o vq. Applying these to the rate in dH), we find that if 

1, ,„„ „9 . . 1 



Rv < 



logadlPhm o voll^ + Im,v) - ^ log2 ( ||am||^ (1 + Im,v) + P ( ||am||^ ||hm o VqII"^ - |a^(hm o Vo^'^ 



(89) 
then Pr{Ai,,m(i) / K,m{i)} — >■ exponentially. 

Recall that Re = Rr + Rv and R = j^- Choosing T arbitrarily large, we obtain the desired result. ■ 

VI. Establishing the Diversity-multiplexing Tradeoff 

In this section we detail the signaling strategies that estabUsh our diversity-multiplexing claims. We begin with 

the non-cooperative case. Our approach is to choose the steering vector vq such that the effective channel vector is 

constant and has unit gain. It turns out that this approach fails approximately as often as does a single SISO link. 

Proof of Theorem [?]• First we prove the upper bound. For the non-cooperative case, it is shown in lH 

Theorem 13] that the computation capacity is upper-bounded by 

C(H, P) < max min ^ log2(l + Phfj (90) 

Ae^ l,m 2 

<ilog2(l + P/iL), (91) 

where / and m can be chosen arbitrarily such that aim 7^ 0. Then the computation capacity is bounded by the 
Shannon capacity of a single SISO Unk, which is proven in P8l to have diversity-multiplexing tradeoff d*{r) = \—r. 
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The compute-and-forward network therefore has DMT bounded by 

d*{r)<dtc{r) = l-r. (92) 

To prove the lower bound for M = 1, we invoke the non-cooperative rate of Corollary choosing vq to align 
with the channels. For multiplexing gain r, we choose a = 1 and vf = P'^~^/hf, resulting in the achievable rate 

i?(H, P) = i log2(l + LP^) - ^ log2(L) (93) 

>^log2(^"). (95) 

Outage occurs only when we cannot set vi to the specified value. Since we are constrained to have vf < 1, this 
occurs when hf < P^~^. The probability of outage is therefore 

Po<Pr\[Jhi <P''^\ <X]Pr{/if <P''-^} ^LP""-^. (96) 

U=l J 1=1 

Therefore, our scheme gives a diversity order at multiplexing gain r of 

d- (r) = lim -M^ (97) 

P^oo log(Pj 

(1 - r) log(P) - log(L) 
> lim ^ ' °\' ^^^ (98) 

P^oo log(P) 

= 1 - r. (99) 

Since this matches the upper bound, the DMT is established. ■ 

Next we prove the results for the cooperative network. First we prove the upper bound. 

Proof of Theorem^ We invoke the MISO outer bound on the computation capacity from Theorem [T] The 
symmetric -rate capacity of the MISO broadcast channel is trivially upper bounded by the capacity of the single-user 
MISO link between the source and any destination. Thus the DMT is upper-bounded by that of a single L-antenna 
MISO link, which is shown in HSl to be d*{r) = L(l — r). Thus the DMT of the cooperative compute-and-forward 
network is bounded by 

d*(r) <d+(r) =L(l-r). (100) 



Next we prove Theorem |5] for which we need first to establish an achievable rate using random codes and 
time-sharing. 

Lemma 4: Let B C {I,-"" i^}- In the cooperative compute-and-forward network with M = 1 receiver, the 
following computation rate is achievable: 

i?random(H,G,P)=min|mixiic^ac(a[l:^-l,^ + l:i],^,l),Jlog2(l + ^(hBl)')}- (101) 

Proof: The encoding scheme is simple, so we only sketch the proof. Divide the transmission into two equal 
time blocks. At the first block, each transmitter encodes and broadcasts its message using a random Gaussian 
codebook of power P. The transmitters in B decode the incoming messages using typical sequence decoding. This 
is nothing more than a Gaussian multiple-access channel, so decoding is successful as long as the rate is below the 
first term in (llOll l. The multiple-access rate is cut in half due to time sharing. 
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At the second block, the transmitters in B directly encode and broadcast the linear combination desired at the 
receiver, again using a random Gaussian codebook of power P.. The receiver decodes the desired function from 
the signal received in the second block only. This is equivalent to a MISO channel with equal beamformer weights, 
so decoding is successful as long as the rate is below the second term in (IIOII ). Again the MISO rate is cut in half 
due to time sharing. ■ 

Now we are in a position to prove Theorem |6] 

Proof of Theorem |6]- We construct an achievable scheme based on the strategy from Lemma |4l We allow B 
to vary according to the channel reaUzations, giving us the achievable rate 

i?(H, G, P) = maxmin jmin ^Cmac(a[l : I - 1,1 + 1 : L],P,l),- log2(l + P(hgl)2)| (102) 

> maxmin jmin \c^U%l[^ : I - 1,1 + I : L\,P,l),- log2(l + Pdlh^f )| . (103) 

Let each rate term in (I1031 l be denoted by i?g(H,G,P). Then define the event in which a particular cooperation 
modality fails: 

OB = {^B(H,G,P)<^log(P)}. (104) 

Outage occurs when each cooperation modality fails simultaneously: 

= [^0b (105) 

B 
L 

C fl 0{iy (106) 

1=1 

That is, we consider only the events in which a single transmitter decodes the messages. Each term in (I1031 l has 
two components, the failure of either of which results in the failure of the cooperation modality. Therefore, define 
two events: Ci, the event that transmitter / fails to decode the other transmitters' messages, and A//, the event that, 
even if transmitter I decodes successfully, the receiver fails to decode the linear function. The first event can be 
expressed as 

Cl = |^C^ac(a,P,l) < ^log(P)| (107) 



£C{1,-,L}\{0 



i,-,L}\{i} I ' ' V vec / ) 

U {Y.91i<P'^''^'-'] (109) 

i,-,L}\{0 U'e£ J 

U [^{9li<P'^'^"-'}\- (110) 

i,-,L}\{n Ki'ec ) 



£C{1,-,L}\{0 

c 



£C{1,-,L}\{0 

The second event can be expressed as 

AAz = |ilog2(l + P/.2) <^log2(P)| (111) 

Since each cooperation modality involves a different set of channel coefficients, the failure events Oi are independent. 



23 



Therefore we can bound the outage probability by 

L 

Vi{0)<\[Vt{CiUMi) (113) 



1=1 



^ n (e n pr(5?. < p^i^'^-^) +pr(/^? < p'^-')] (114) 

1=1 \ c I'ec ) 



L-1 



max pl^l(2|/:|--l) + p2r- A ^ (jjg^ 



\i<|/:|<L-i 

To find the terms with the largest error exponent, we need to find the value of |£| that maximizes the quadratics in 
(I116I ). For instance, it is clear that for r = the maximizer is |£| = 1. In general, since the quadratics in question 
are positive, the maximizer is either |£| = lor|£|=L — 1. This gives us 

Pr(0) < (max{p(^-i)(2(^-i)'-i),p2'-i| +p2r-i^^ ^^^^^ 

« (max{p(^-i)(2(^-i>-i),p2r-i|^''. (118) 

Finally, plugging dllSI ) into the definition of the DMT, we get 

log(Pr(0)) 
d (r) = hm — — -—— (119) 

P^oo log(P) 

> Lminjl - 2r, (L - 1)(1 - 2(L - l)r)}. (120) 

■ 
Next we prove the lower bound achieved by cooperative lattice coding. Tuning vq to the channel vector h 
makes it too difficult for transmitters to decode others' messages, so we take vq = P~^ , where x depends on the 
multiplexing gain. Increasing vq decreases the error probability at the transmitters, but it increases the Cauchy- 
Schwarz penalty and therefore the error probability at the receiver. For r = 0, choosing x = 1/2 is optimal, which 
gives us a diversity gain of approximately 1/2 for each transmitter. 

Proof of Theorem [T]- The proof follows a similar outline to that of Theorem |6] except that we use the rates 
proved in Theorem |3] using lattice codes. Again we allow the subset of cooperating users B to vary according to 
the channel realizations, and we choose a = 1, resulting in the following achievable rate 

P(H, G, P) = max mill j minC7mac(a[l : / - 1, / + 1 : L] o vo[l : / - 1, / + 1 : L], P, 1), 

^ log2(l + P(||ho vof + |hV|')) - \ log2 (l + p{l ||h o vof - |l^(h o vo)|2)) |. (121) 

Similar to before, we let each term in ()121l i be denoted by Pb(H, G,P) and define the events corresponding to 
the failure of each cooperation modality: 

OB = {i?s(H,G,P)<^log(P)}. (122) 
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Outage occurs when each cooperation modaUty fails simultaneously: 

= f]OB (123) 

B 

L 

c 0{i,...,z.} n fl 0{i} n O0. (124) 

1=1 

Here we consider the events in which all transmitters cooperate, in which L — 2 individual transmitters cooperate, 
and in which no one cooperates. When i3 = 0, we use the strategy outlined in the proof of Theorem 51 choosing 
vf = P^^^ /hf. Following that line of analysis, the non-cooperative modality fails only when every channel gain 
is too low: 

(r?0 C I U /li < P'-' I . (125) 

For ;B 7^ 0, we choose vio = p~^b/2 ^^^ every /, and vn = 1 — P~^'^''^ for every / G B; otherwise vu = 0. 
Using this, we can bound the rate as follows: 

i?B(H,G,P) > minj mmC^ac(gz[l : / - 1,Z + 1 : ^],i^'-"^ 1), 



ilog2(l + Pj]|/iHM-^log2(^ + P'-"'^(i||hf-||hf))]^|. (126) 



For large P, we get 



i2B(H,G,P) > min|minC7^ac(gz[l : / - 1,/ + 1 : L],pl-"^ 1) ^ logs fTT^ff^T^l I ' ^^^V) 

I 'GB Z y (^/^ _ Ij ||h|| J J 

As before we define events corresponding to the failure of either term in (I1271 i: Cb, the event that the transmitters 
in B fail to decode the other transmitters' messages, and Mb, the event that, even if the transmitters decode each 
other properly, the receiver fails to decode its linear function at the required rate. The first event can be expressed 
as 

Cb = U {Cmac(a[l : / - 1,/ + 1 : L],pl--^l) < ^log(P)} (128) 

l&B 

= U U I ^ i°g2 ( 1 + P'-^'' E al] < \ log(P) I (129) 

l£B C(1{1,- ,L}\{1] I ' ' V vec J J 

leB cc{i,- ,L}\{1] ^I'f^c ) 

l&B C(1{1,- ,L}\{L} Kl"^C ) 

For B = {1,- • • ,L}, the second event can be expressed as 



1 . / P^fi ||h|l2 



= {P^d, ,M < (L - 1)P'-} . (133) 

Based on (11331 ). we choose X|i ... /,} = r + e for any e > 0. As P — )• oo, this forces A/{i ... _/,} — )• deterministically. 
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For B = {I}, we can express the second event as 



M 



{1} 



1 / P^'oft? \ r 



hf 



(L-l)||h| 



^ pr-xiiy 



C {hf < p^-^i'}-^] u \ ||hf > 



P' 



L-1 



cf]{hf<p^~^<'^-^}uhhf>j—j 



leB 



P^ 



(134) 
(135) 
(136) 
(137) 



Combining the above with (11241 ). we get 



Oc 



.ie{ 



u u n {^- < pi^i— ^}) 

i,-,i}£c{i,--,L}\{«}«'e£ / 



n 



n 

ie{i,-,L} 



( U n {af'i < pl^l'-+^«-^} ) u {{hf < P— «-}) 



n 



j=i 



P' 



U/.f<P^-Uu ||hf>^ . (138) 



Equation (11381 ) contains too many terms to enumerate in full. Since we are concerned with asymptotic behavior, 
we need only look at the term with the highest error exponent. This term contains one channel failure in (^{i,... ,l}, 
L — 2 failures in C^iy n A/{;}, and one failure in A/'0. The final error event, in which ||h|| is too large, has negligible 
contribution to the error probability. Combining these, we get 



Pr(0) < Pr ( U n {af'i < Pl^l^'+^+^-^} ) Pr {{hi < P-i}) 

V/:c{i,-,L}\{i}«'G/: / 



nPr U n {af'i < Pl^''-+^f'>-i} U {\hi\' < P--W-} . (139) 

1=3 \cc{i,-,L}\{i}i'eC 

Since each term in (11391 ) is independent, we can evaluate the probabilities separately, yielding 



PrfO) < I y^ rp\C\r+x+e-l\\C\ j (^pr-l\ V^ /p\C\r-x-l\\C\^pr-x- 

,CC{1,- ,L}\{n J \CC{1,- ,L}\{3} 



L-2 



L-2 



P 



r-1 



E (^' 



C\i\C\r+x+e-l) 



y^ {pm{mr-x-l)\ _^ pr-x-e 



(140) 



(141) 



^£C{1,-,L}\{1} / \£C{1,-,L}\{3} 

where we have chosen x^iy = x for every I. Similar to the proof Theorem |6] the maximizer of the quadratics in 
(11411 ) is either |£| = 1 or |£| = L — 1. This gives us 



L-2 



Pr(0) < P'-i (max{p2--l+^p(l-^)(l-'■^)+(^-l>}) (maxmin {p'-+--\p(i-^)(i-(^-i>--), P^— ^}) 

(142) 
Finally, plugging (11421) into the definition of the DMT, taking the supremum over all e > 0, and taking the maximum 
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over all x yields 

.,(,■)= li^MSig" (143) 

P->oo log(P) 

> 1 - r + min{[l - 2r] + , [{L - 1)(1 - rL)] + ]+ 

max (L - 2) mm{[l -x- r] + , [{L - 1)(1 - (L - l)r - x)] + , [x - r]+}. (144) 

0<a;<l 

■ 

Although their proofs are similar, the strategies Theorems [6] and |7] achieve very different diversity-multiplexing 
tradeoffs. With random coding, transmitters decode and cooperatively transmit at separate blocks; such time division 
enables full diversity, but it leads to poor multiplexing performance. With lattice coding, on the other hand, we need 
to balance transmit power in order to ameliorate the effects of signal misalignment; the balance costs us diversity 
gain, but the multiplexing performance is improved. The overall message is that transmit cooperation improves 
diversity and multiplexing for compute-and-forward, and as we saw in Figure |2] the two approaches combined 
achieve the comer points of the DMT region. 

VII. Numerical Examples 

In this section we examine a few example scenarios in which to demonstrate the benefits of our approach. 

Example 1: The first example, depicted in Figure |5] comprises L = 2 transmitters and a single receiver. The 
channels are symmetric, with the forward coefficients constant /ii = /12 = 1 and the inter-transmitter coefficients a 
variable gi2 = 521 = 9, which we vary such that the gain g^ ranges between — lOdB and 30dB. We set the transmit 
SNR at P = lOdB. Since the channel gains are symmetric, either both transmitters can decode the other's message 
or neither of them can; therefore we choose either B = {1,2} or i3 = for cooperative computation. Similarly, 
by symmetry it is easy to see that the optimal choice for the linear function is a = (1, 1)-^ and that the optimal 
steering vectors vq and vi are constant. We find the optimal tradeoff between vq and vi numerically. 

I \ ~ - - 

■'I , ' ' 

Fig. 5. Example [T] A two-by-one computation network with symmetric channel gains. 

In Figure |6] we plot the achievable rate of our cooperative scheme against the upper bound of Theorem |2] 
using the Nazer-Gastpar rate of (l8]l as a baseline. The trends are easy to appreciate. When the channels between 
transmitters are weak, decoding each other's messages is too difficult, and the cooperative rate collapses to dH). As 
the inter-transmitter gains become stronger it becomes easier for the transmitters to decode, and cooperation can 
improve the computation rate and eventually approaches the upper bound. We note a "dimple" in the cooperative 
rate as g"^ becomes large. For sufficiently large g"^, the optimal strategy is to turn the steering vector vq down far 
enough that the Nazer-Gastpar component of the cooperative rate is zero, meaning that only the jointly-encoded 
resolution information carries information to the receiver. At this value of g^ we see the dimple, after which the 
rate quickly converges on the upper bound. 

Example 2: Next we examine a scenario in which channel gains are chosen randomly, as depicted in Figure 
I2I We place a single receiver at the origin and place L = 3 transmitters randomly and uniformly on a segment 
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Fig. 6. Achievable rates as a function of inter-transmitter channel gains for Example [T] 

of the circle having specified arclength. From the geometric configuration of the network, we compute channel 
magnitudes according to a path-loss model: 





arclength = vr arclength = 7r/2 

Fig. 7. Example |2l Three users are placed along a segment of the unit circle, while the receiver is placed at the origin. 



9ij 



1 



d{i,j) 



hi 



1 



(i(i,0)"' 



where d{i,j) is the Euclidean distance between users i and j and. We choose P = lOdB and a path-loss exponent 
of a = 4. 

For each realization we calculate the cooperative computation rate. Since the gains from transmitters to receiver 
are equal, a = (1,1,1)^ is the optimal choice. The steering vectors and the clusters are optimized numerically. 
We run 500 simulations each for arclengths varying from to vr, and plot the average computation rates in Figure 
[8] Again the trends are easy to appreciate. Cooperation offers the greatest improvement when transmitters are 
close together. Even as we spread transmitters further apart, on average enough transmitters can cooperate that our 
approach gamers a noticeable improvement. 

Example 3: In the next example we examine the variation in cooperative computation rate with respect to the 
channel gain between transmitters and receivers. As depicted in Figure |9] we again have L = 2 and M = 1, but 
now we set every channel gain to unity except for hi2. Since the channels between transmitters and receiver are not 
symmetric, we cannot take a = (1, 1)^ or vq and vi to be constant. Instead, we iterate manually through possible 
choices of a and numerically optimize over the set B of cooperating transmitters and the steering vectors vq and 
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Fig. 8. Average computation rate vs. angle between transmitters for Example |2] 
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Fig. 9. Example [3] A two-by-one computation network with asymmetric channel gains. 



In Figure [To] we plot the cooperative rate alongside dU for a variety of transmit signal-to-noise ratios P. We make 
a few observations. First, the non-cooperative rate is low for /i2i near to zero. Since we require the function to 
contain elements from both transmitters' messages, it becomes difficult for the receiver to decode such a function. 
In the cooperative case, however, the rates do not fall, since transmitter 1 can decode W2 and transmit the desired 
function to the receiver. This result hints at the diversity gains inherent to the cooperative approach; even when 
one link fails, successful computation is possible. 

Furthermore, in the cooperative case we get the full multiplexing gain as the SNR becomes large. In the non- 
cooperative case we observe "peaks"; these correspond to rational channel gains with low denominator. The further 
/i2i is from a low-denominator rational, the harder it is to align the function with the channels and the higher 
the Cauchy-Schwarz penalty in dSjl. However, we can always choose vq such that the equivalent channel vector is 
rational, allowing us to completely eliminate the Cauchy-Schwarz penalty. We note that this is not explicitly due 
to the cooperative nature of our approach; as shown in |4] non-cooperative transmitters can get the full multiplexing 
gain using lattice codes. However, our cooperative approach does permit the transmitters to use the remaining power 
to secure rate and diversity gains. 

Example 4: Finally, we examine the system depicted in Figure [H] Here we have L = M = 2, and again we set 
all channel gains to unity except for /121. Again asymmetry prevents us from choosing a and the steering vectors 
easily. We iterate manually over the possible choices for a, choose zero-forcing beamformers for vi and V2, and 
numerically optimize over vq. In order for zero-forcing to succeed, we choose B = {1,2}. 

In Figure [12] we plot the cooperative rate alongside ((8), again for a variety of signal-to-noise ratios. Again we 
make a few observations. In contrast to the previous scenario, here the rate drops when hi2 ~ 1; this is because 
the channel matrix becomes increasingly ill-conditioned. Similar to before, in the cooperative case the rate remains 
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Fig. 10. Achievable rates as a function of /121 and P for Example |3] 

non-zero, but here it occurs because the transmitters can cooperatively send a full-rank set of equations even though 
the channel matrix is nearly singular. However, in this example cooperation does not obtain the full multiplexing 
gain. The freedom to choose vq allows us to mitigate the peakiness of the achievable rate, but we cannot eliminate 
the Cauchy-Schwarz penalty at both receivers simultaneously. Even for high SNR, however, we do get considerable 
robustness to channel variation. 



VIII. Conclusion 

We have studied the impact of user cooperation on compute-and-forward. Constructing a lattice-coding version of 
block Markov encoding, we presented a strategy that introduces a "decode-and-forward" element into computation 
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Fig. 11. Example |4] A two-by-two computation network with asymmetric channel gains. 

coding. Transmitters decode each other's messages, enabling them to transmit resolution information cooperatively to 
the receivers. Our strategy achieves higher computation rates than previous approaches, since transmitters can jointly 
encode part of their messages, and coherent signals benefit from a beamforming gain. Additionally, cooperation 
enables an improvement in the diversity-multiplexing tradeoff, achieving full diversity when there is a single receiver. 

In the case of multiple receivers, however, we have not established an achievable diversity-multiplexing tradeoff. 
The difficulty of aligning lattice codewords at multiple receivers suggests that lattice coding is insufficient for the 
task. A promising approach may be to introduce a cooperative element into the signal-alignment strategy of [12]. 
Since this approach achieves the full multiplexing gain for multiple receivers, we expect to be able to obtain a 
non-trivial characterization of the DMT regardless of the number of receivers. 

Finally, we note that our techniques can be applied to any situation in which one needs to merge lattice codes 
with decode-and-forward style cooperation. Our block Markov approach is rather general; as mentioned earlier, it 
can be used to achieve the capacity of the physically degraded relay channel or to achieve the decode-and-forward 
rates of the cooperative multiple-access channel. We therefore expect our techniques to be useful for developing 
new strategies and estabhshing new results in areas where lattice codes and cooperation are applied, such as 
physical-layer security Il55l - ll59l and interference channels |[60l - ll62l . 

Appendix A 
Proofs of Upper Bounds 

Our first task is to prove Theorem [T] for which we need a quick lemma. 

Lemma 5: Let wi,--- ,Wi G F^ be independently and uniformly distributed messages. Then, the functions 
fi, • • • , f M are also independent and uniformly distributed across F^. 

Proof: Since the finite-field linear combinations in f/ are taken element-wise, it is sufficient to show the 
result for an arbitrary element of both messages and functions. Therefore, let w = [wn,--- ,'Wli)'^ and f = 
(/ii, • • • , /a/i)^ = Aw. We need to show that the elements of f are independent and uniformly distributed. 

Since w is uniformly distributed over F^ , its probability mass function is 

p(w) = p-^. (145) 

The conditional pmf of f is 

p(f|w) = <5(f- Aw), (146) 
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Fig. 12. Achievable rates as a function of /121 and P for Example |4] 



where (5(-) is the Kronecker delta function. Next we compute the marginal pmf for f : 

p(f) = ^ p(f|w)p(w) 

E ^(f-Aw) 



w6F^ 
L 



P 



weF^ 



p {w|Aw = f} 



p'LpL-M 



P 



^M 



(147) 

(148) 

(149) 
(150) 
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where (|149l l follows because A is full rank. Since the pmf p(f ) does not depend on f , the elements are independent 
and uniformly distributed. ■ 

With Lemma |5j it is straightforward to prove Theorem [T] 

Proof of Theorem^! Suppose that a genie provides the messages w;(t) to each of the transmitters. Then the 
transmitters each can compute the functions fm(i)- By Lemma [5] these functions are independent and uniformly 
distributed, the scenario is equivalent to an L-transmitter antenna having M independent messages to send to M 
users. In fV^ the capacity region is shown to be (ITOl ). Since we define the computation capacity in terms of 
achievable symmetric rate, it cannot exceed the symmetric -rate MISO capacity given in (IT2l ). ■ 

Next we prove the upper bound in Theorem |2] 

Proof of Theorem |2} Choose a transmitter /, and suppose that a genie supplies the messages wi (t) to the 
receivers for every /' 7^ /. By the crypto lemma [29|, each fm(t) such that aim / is statistically independent of 
the messages wi'{t), so the receivers remain equivocal as to the desired functions. Thus the scenario is equivalent 
to a compound relay channel, with transmitter / acting as the source, the transmitters I' acting as relays, and each 
receiver m such that aim / acting as destinations all needing the messages wi{t). The capacity of the compound 
relay channel can be bounded using cut-set arguments. For any cut S & Si, the capacity of the compound relay 
channel, and thus the computation capacity of the cooperative compute-and-forward network, is bounded by 

C(H,G,P) < max min I{xuxs]ym,zsc\xsc) (151) 

p(x) m,ai„^^0 

< mill m.ay.l{xi,xs;ym,zsc\xsc). (152) 

Taking the minimum over all transmitters and all cuts S, we obtain the result. ■ 

Appendix B 
Entropy of dithered lattices over the multiple-access channel 

Here we prove that the mutual information between dithered lattice codewords and any receiver approaches that 
of a Gaussian multiple-access channel. 
Lemma 6: Let 

x/ = ^/P[\i + ti] mod A, (153) 

be a collection of independent lattice codewords, dithered across the shaping lattice, for 1 < / < L. Let 

L 

y = Y^ hm + n, (154) 

1=1 

be a noisy sum of the codewords, where the noise n has i.i.d. elements with variance o"^. Then, for any set 
;B e {1, • • • , L}, the normalized mutual information between the transmit signals and the receive signal approaches 
at least that of a Gaussian multiple-access channel: 

lim il(xe; ylx^jc) > \ log2 (l + l^dfh\ . (I55) 

When n is Gaussian, this bound is tight. 

Proof: Since y is the sum of transmitted signals, conditioning entails only subtracting away the known 
component. Therefore, letting 
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the mutual information is 

lim -I{xB;y\xBc) = lim -/(xe;ye)= lim -(/i(ye) - /i(n)), (157) 

n— ^oo n n^foo n n— >-oo n 

where h{-) is the differential entropy. Since the Gaussian distribution maximizes the differential entropy for a given 

variance, we have 

-/i(n) < -log(27recj2). (158) 

n 2 

To bound h(yt3)^ we note that in fSj Lemma 8] it was shown that the density function /y^ is bounded by 

/y, <e^(")"/y., (159) 

where y* is an i.i.d. Gaussian vector with variance PYl,i(^B^'i + '^^' ^^^ '^('^) ^^ ^ '■^'^"^ approaching zero from 
above as n — ;• oo. Plugging this into the definition of differential entropy, we have, for sufficiently high n, 

-Kyb) > -- /e^(")"/y. log(e^(")"/y.) (160) 

n J n 

= e<^)^(U{y*)-c{n)] (162) 

>-h{y*)-c{n) (163) 

n 

-^ -h{y*) (164) 

n 

= ^log 1 27re I P^+aAy (165) 

where (11631 ) follows because e'^^"^" > 1 and for sufficiently high n the term ^h{y*) — c{n) is positive. Combining 
(11581 ) and (11651 ). we get that 

lim -I{^B;y\^Bc) > l\og l2TTe I Py^hf + aA] - }-\og{2Trea^) (166) 



^-(-^^^'^ 



log2 1 + ""'^" ' . (167) 



When n is Gaussian, it is well-known that Gaussian inputs are optimal and result in the same mutual information 
as the bounds just established. In this case the bound is tight. ■ 
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